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Abstract 

We investigate the overlap operator with a UV filtered Wilson kernel. The filtering leads to a better 
localization of the operator even on coarse lattices and with the untuned choice p = l. Furthermore, 
the axial- vector renormalization constant Za is much closer to 1, reducing the mismatch with 
perturbation theory. We show that all these features persist over a wide range of couplings and 
that the details of filtering prove immaterial. We investigate the properties of the kernel spectrum 
and find that the kernel non-normality is reduced. As a side effect we observe that for certain 
applications of the filtered overlap a speed-up factor of 2-4 can be achieved. 



1 Introduction 



From a theoretical viewpoint the ascent of "overlap" fermions 0121 El; i-e. fermions which at zero quark 

lO mass satisfy the Ginsparg Wilson (GW) relation 4j (p is a parameter that will be specified later) 
O 
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75^ + ^75 = 0, 75 = 75(1 -1/}) (1) 

p 

and thus realize a lattice version of the continuum chiral symmetry [Hj 

Sip = 75^, Sip = ^^75 (2) 

together with an index theorem [HI EI, represents a major breakthrough in the field of non-perturbative 
studies of QCD. We know how to discretize fermions in a way that preserves the relevant symmetries: 
(i) gauge invariance, (ii) flavor symmetry, and [iii) chiral invariance. Unfortunately, from a practical 
viewpoint the usefulness of this concept is limited by the fact that the overlap tends to be one to two 
orders of magnitude more expensive, in terms of CPU time, than a standard Wilson Dirac operator. 

In this paper we study a variant of the overlap operator which makes use of a UV filtered Wilson 
kernel. Here, the "filtering" refers to replacing the original ("thin") links of the gauge configuration 
in the standard definition of the Wilson kernel by "thick" links obtained through APE [8 or HYP fH] 
smearing. This is a legal change of discretization as long as one keeps the iteration level and smearing 
parameters fixed all the way down to the continuum, since the "thick" links transform under a local 
gauge transformation in the same way as the "thin" links; it should be seen as a modification of the 
operator and not of the gauge background. Such filtering has been used in the context of staggered 
quarks, where it has been found to reduce UV fluctuations, in particular taste changing interactions 
due to highly virtual gluons ^U]. In Ref. fl] filtered staggered quarks were compared against overlap 
quarks (where the filtered version was merely considered for completeness), and it was observed that a 
single filtering step may speed up the forward application of the overlap operator Dov on a source vector 
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by a factor 2-4, depending on the gauge background. This was seen to come through a reduction of the 
degree of the Chebychev polynomial needed to approximate the inverse square root or sign function in 
the definition of the massless overlap [3j 

aDov = p[l + -p(^w,-p^w,-p)"^^^] = p[l + 75 sign(a75L)w -p)] (3) 

with iI'w,-p = -Dw ~p/0' the Wilson operator at negative mass —p/a. However, what matters in view of 
most phenomenological applications is the performance of the massive operator (bare quark mass m) 

(XTfl 

Do^^m = {I- —)Do^ + m (4) 
ip 

in the process of calculating a given physical observable to a pre-defined accuracy. In other words the 
total CPU time spent depends on: 

1. The number of forward applications of the shifted Wilson operator Z)w,-p (or, generally speaking, 
of the kernel) needed to construct the massless overlap operator 0. 

2. The number of iterations spent on inverting the so-constructed massive operator for a given 
renormalized quark mass (or a given M^). 

3. The number of gauge backgrounds needed to reach a pre-defined statistical accuracy of the desired 
observable at a given lattice spacing a. 

4. The lattice spacing needed to enter the scaling window. 

The main emphasis of this paper will be on point 1; in particular we attempt to give an understanding 
of the observed speedup in terms of the spectral properties of the underlying hermitean (shifted) Wilson 
operator if-yv = 75-Dw,-p- At first sight it might seem that point 2 does not need to be considered at 
all. At fixed bare mass m and fixed p the filtered and the unfiltered overlap do not differ on this point, 
since the number of forward applications of -Dov,m to get a column of the inverse depends only on its 
condition number, and that is 2p/m for either variety. As we shall see, the optimum p (w.r.t. locality) 
gets reduced through filtering whereas = Zg^ = Zp^ increases and this means that in the filtered 
case one has to use a smaller bare mass to work at a fixed physical m'^'^^ = Z^^- These two aspects 
tend to compensate, and as a result there is little net effect on point 2 from filtering. Whether in points 
3 and 4 filtering brings further savings is not clear, but we plan to address this issue in the future. 

Let us try to obtain a first understanding of the effect of filtering in terms of the spectrum of the 
underlying (non-hermitean) Wilson operator. We are going to compute all eigenvalues, and to avoid 
spending too much CPU time on this illustration, we shall do this in 2D, but it is clear that the 
conceptual issue is - mutatis mutandis - the same as in 4D. In d dimensions the Wilson Dirac operator 
has d+1 branches, and the respective flavor multiplicities are 

- ' C) ■ 

Thus in 2D the Wilson operator has 3 branches with multiplicities 1,2,1, while in 4D it has 5 branches 
with multiplicities 1,4,6,4,1, respectively. Fig.[T] shows the complete spectrum of with the hopping 
parameter fixed at its tree-level critical value, k = 0.25, on 10 configurations of size 16^ at (3 = 3.2 in the 
quenched Schwinger model. Besides the "thin" link operator also its UV filtered descendent is shown. 
In terms of the kernel spectrum the filtering is seen to have the following effects: 

(a) The two/four "bellies" are depleted — in particular exactly real modes which cannot be assigned 
in a unique way to one of the three/five branches are severely suppressed. 

(b) The horizontal scatter of any of the three/five branches diminishes. 
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Figure 1: Eigenvalue spectra of with k = kJ;^j^ = 0.25 in the quenched Schwinger model (16^, /3 = 3.2, 
10 configurations) without filtering and after 3 steps with a = 0.5 (i.e. equal weight to the original link 
and the staple, see ^2] for details; in 2D APE involves already the full hypercube). Filtering depletes 
the "bellies", makes the physical (leftmost) branch narrower and shifts it to the left. 

(c) The additive mass renormalization of the physical (leftmost) branch is substantially reduced. 

If one were to ignore the kernel non- normality (we shall come back to this point), the spectrum of 
could be hnked, on a mode-by-mode basis, to the one of ifw = 75-Dw,-p and Z^ov Then 

the first observation above (the reluctance of the filtered eigenvalues to show up near the projection 
point p) simply means that the effect of filtering on the spectrum of i^w is to deplete the vicinity of 
the origin by pushing the eigenvalues further towards the ends of the interval [— 2(i+l, 2d—\]. In spite 
of the caveat mentioned, the thinning effect that (any kind of) smearing has on the spectrum of i^w 
near zero is indeed the reason for the speedup in point 1 above. A bigger interval [0, or ] — e, e[ that 
does not need to be covered by the polynomial/rational approximation to the 1/^ or sign(.) function 
translates into a lower degree and thus into fewer forward applications of the kernel operator. 

In the remainder of this article we shall address the spectral properties of i^w in more detail (Sec. 2), 
and show that (a reasonable amount of) filtering does not degrade the locality properties of D^v, but 
rather makes the overlap operator more local (Sec. 3). We continue with an explicit demonstration 
that the kernel non-normality gets reduced by filtering (Sec. 4). We add some observations relevant to 
phenomenological applications of the filtered overlap; in particular is shown to be much closer to the 
tree-level value 1 than for the unfiltered variety (Sec. 5). We rate this as a sign that perturbation theory 
might work far better for the filtered overlap. We make an attempt to compare our simple filtering 
recipe against other approaches (Sec. 6). Finally, the appendix contains spectral data which suggest 
that the spectral density of i^w at the origin is non-zero for any j3 and any filtering level. 

We shall use pure gauge backgrounds and set the scale through the Sommer parameter rg jlH]. We 
choose the Wilson gauge action, and since ro(/9) is known [H] it is easy to select [3 values such that the 
resulting lattices are matched, i.e. have fixed spatial size L~ 1.5 fm, with the resolution varying by a 
factor 3 from the coarsest to the finest lattice - see Tab.Qfor details. Henceforth we set a = l. 
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Ifi"^ 


24"^ 


geometry 


8^x16 12^x24 


16^x32 





Table 1: Survey of matched 4D couplings and geometries with fixed L/ro~3, according to the interpo- 
lation formula of Ref. ^3]. The first coupling is slightly out of bound (see discussion in |14j). 



2 Speedup and kernel spectrum 

In a quenched simulation the overhead, in terms of CPU time, of overlap versus Wilson quarks comes 
in the first place from the polynomial or rational approximation to the 1/^ or sign(.) function in Q. 
Let us assume^ that the lowest eigenvalue of the unfiltered |-ffw| is 0.14 while the highest eigenvalue 
takes the free field value 7. This leads to the task to construct a polynomial/rational approximation 
of the inverse square root over the range [e^, 1] with e = 0.02 the inverse condition number of |-ffw|- 
Modest filtering will lift the lowest eigenvalue to something like 0.49, while the largest eigenvalue is 
almost invariant. Then the task is to construct the approximation over the range [e^, 1] with e = 0.07 
the filtered inverse condition number. The lower bound increasing from 0.0004 to 0.0049 means that one 
gets away with a smaller overall polynomial degree or an increased minimum root of the denominator 



and this 



polynomial. Therefore, the filtered overlap requires fewer forward applications of -D{y 
is how the savings on CPU time in point 1 above come about. In the remainder of this section we will 
elaborate on this statement, replace the fictitious numbers by actual figures from real simulations and 
see that the conclusion remains unchanged. 

Fig.|21shows, as an illustration, the 15 lowest eigenvalues [T3] of |-ffw| on 25 configurations at /5 = 6.0, 
without filtering and after 1,3 steps of APE or HYP smoothing. The filtering increases the upper end 
of the band of eigenvalues shown. In fact, just this upper end matters in terms of CPU time, since in 
practice one projects out the lowest few modes ^H] and constructs the function in ^ over the relevant 
spectral range of |-ffw| on the subspace orthogonal to these modes. Hence the sequence of the 15th 
eigenvalue represents the relevant quantity, if 14 modes are treated exactly, and this band gets lifted 
by filtering. Evidently, a single APE step is less efficient than a single HYP step, and adding two more 
steps lifts the 15th eigenvalue further, but the lifting factor is no more as large as it was in the first 



-'^In fact, these values are rather close to the actual situation at /3 ~ 6.0, after projecting out the lowest 10-15 eigenvectors. 
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Figure 2: Sequence of the lowest 15 eigenvalues of \Hy/\ on 25 configurations at P- 
and after 1,3 steps of APE (left) or HYP (right) filtering. Throughout p = l. 
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Figure 3: Mean and standard deviation (/3 = 5.66, 5.84, 6.00, 6.26, from top to bottom) of the 15 lowest 
eigenvalues of |-ffw| P = l semi-logarithmic form with 0,1,3 steps of APE or HYP filtering. 
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Table 2: Start of the "bulk" part of the eigenvalue spectrum of the shifted hermitean Wilson operator 
|iJw| without filtering and after one or three APE or HYP steps. We use the mean of the 15th-smallest 
eigenvalue to define the "bulk" edge. Unless indicated otherwise, the numbers refer to the case p=l. 
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step. Here and below we use the parameters oape = 0.5, ohyp = (0.75, 0.6, 0.3) (for details of the 
S'f/(3) projection see e.g. [T7j or the appendix of p!lj) and, unless stated otherwise, p=l. 

Fig. El shows the mean and the standard deviation of the 15 lowest eigenvalues of \Hy^\, with our 
standard filtering options (none, lAPE, 3 APE, IHYP, 3 HYP). In this logarithmic representation it 
is easy to see that (apart from the coarsest lattice which represents a special case discussed in App. A) 
all 15 eigenvalues get lifted, at a given coupling, by virtually the same factor. Specifically, the 15th^ 
eigenvalue gets multiplied by AiHYp/Anone = 4.8, 4.2, 3.2 at /5 = 5.84, 6.00, 6.26. Thus the lifting effect that 
filtering has on the "bulk" part of the |-f/w| spectrum diminishes somewhat towards the continuum, but 
for accessible couplings it remains substantial. Details of the ensemble average of the 15th eigenvalue 
are collected in Tab.|21 The second observation is that the bands become flatter at large /3, hence the 
onset of the "bulk" becomes a less ambiguous concept at weaker coupling. Had we chosen the 10th or 
20th mode to define the "bulk edge" instead of the 15th, this would cause a small change at /3 = 6.26, 
but it would make a substantial difference at the smallest (3 shown. 

A point of theoretical interest is whether the low-lying eigenvalues of the (shifted) hermitean Wilson 
operator are correlated, between different smearing levels, just as the low-lying eigenvalues of the 

^This number needs to be scaled with the physical box volume; working, for any given /3, in a (2.0 fm)^ box instead of 
(1.5 fm)*, our statement would most likely be adequate for the 47th mode. 
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Figure 5: Eigenvalue flows of i^w with three filterings (none, 1 APE, 1 HYP) on one 16^ configuration. 

final were found to be correlated for large enough j3 jTTj. Fig. HI shows that this is almost true - the 
eigenvalues correlate if they are sufficiently large in absolute magnitude, but the correlation weakens 
closer to the origin. Here, a technical issue comes along. Ideally, one would pair the eigenvalues by 
considering a smooth interpolation between the two filtering recipes. Changes in topology (as seen by 
the overlap operator) would then be evident as stray points in quadrants 2 or 4. However, since we 
just know the eigenvalues shown we decided to pair them starting from 0. Now there are no points 
in quadrants 2 or 4 by definition and changes in topology manifest themselves through a reduced 
correlation of the few lowest eigenvalues in absolute magnitude. Such topology changes are expected to 
occur with an O(a^) re-definition of the overlap operator, e.g. by changing the filtering or p [TT] . 

A similar conclusion is drawn from the flow of eigenvalues i^w as shown in Fig.El for one 16^ 
configuration. One effect of filtering is to stretch the whole scenery in the vertical direction (note the 
vertical scale). Filtering also shifts the entire eigenvalue flow to the left which is consistent with the 
reduction of the additive mass renormalization of the kernel operator as discussed in the introduction. 
Note that there is, from a conceptual viewpoint, no reason to prefer one filtering level over any other 
one; what we see is just a manifestation of the O(a^) ambiguity of the overlap operator [TT] . 

To assess the CPU time needed for the massless overlap, the behavior of the "bulk edge" of the |-ffw| 
spectrum is one ingredient. What really matters is the condition number, thus we need to study the 
largest eigenvalue, too. From the naive discussion around Fig.^ in the introduction one expects that 
filtering barely affects the largest eigenvalue of |-ffw|- K turns out that this is indeed true, for instance 
at (3 = 6.0 a single HYP filtering step lifts it from 6.55(1) to 6.88(1). Hence, filtering has an overall 



10" 



10^ 



10' 



10 



_ □ o 

V „ 



o o 
o 



S 



10 



15 



none 

1APE 

3APE 

1HYP 

3HYP 



o " 



20 



o o 



10' 



10' 



25 



none 

1APE 

3APE 

1HYP 

3HYP 



<> <> 



<> A ^ 



AAA A A , A A A « « A , 



<> <> 



10 



15 



20 



25 



Figure 6: Condition number of \H^^\ on 25 configurations at (3 = 6.0 without projection (left) and with 
14 modes handled exactly (right), after 0,1,3 steps of APE or HYP filtering. 
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Table 3: Mean condition number 1/e of |i/w| P=l after projecting the 14 lowest eigenmodes. 



beneficial effect on the condition number as illustrated in Fig.El Without projection the condition 
number fluctuates wildly and occasionally it may increase through filtering (i.e. the lowest eigenvalue 
decreases, cf. Fig. 12)) but after projecting 14 eigenmodes this never occurs. The bottom line is that the 
combination of filtering and projection reduces the condition number much more vigorously than either 
one alone could do. Average condition numbers after projecting out 14 eigenmodes are collected in 
Tab. El (regarding the first entry, cf. App. A). As a side remark we note that the horizontal increase to 
the left explains why in a fixed physical volume simulating unfiltered overlap quarks on a coarse lattice 
is not so much cheaper than on a fine one; for the filtered version this penalty is reduced. 

We have also studied the condition number of |-ffw| as a function of the parameter p. With and 
without filtering the minimum is rather shallow and at a p value above 1. Since in the free case 

/ p/(8-p) forO<p<l 

\ (2-p)/(8-p) forl<p<2 ^""^ 

we expect that larger (3 values will further drive the minimum location towards p = l. 

The last step is to convert the reduced condition number, brought by the filtering, of |-ffw| on the 
subspace orthogonal to the lowest 14 modes into a lower degree of the polynomial/rational approxima- 
tion of the 1 function in ^ and thus into actual savings of CPU time in step 1 of the introduction. 

The precise speedup factor depends on the implementation of the massless overlap operator (jHl). For 
definiteness let us consider the approximation of the inverse square root over the range [e^, 1] through 
Chebychev polynomials ^Hl- Fig.[7|shows on the l.h.s. for a few inverse condition numbers e of \H^^\ the 
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Figure 8: Mean and standard deviation of the Chebychev polynomial degree used to achieve a minimax 
accuracy of 10~^. At (3 = 6.0 and p = l, a single HYP step results in a speedup by a factor ~4 for the 
massless overlap operator. Comparing to the situation with p = 1.4 and no filtering the factor is ~2. 



well known exponential fall-off pattern of the truncation error of the Chebychev approximation versus 
the number of applications of = .^D^v.-p- What matters for our purpose is the dependence 
of the polynomial degree required to reach a fixed minimax accuracy - say 6 = 10~^ over the full 
approximation range - on e. As is evident from the r.h.s. of that figure, the relation 

degree oc e^"*^ (7) 

holds in good approximation. Thus, from ((Zj) and a look at Tab.Elone predicts that at /3 = 6.0 and p = l 
a single HYP step will speed up the construction of the overlap (on average) by a factor 53.2/13.3 = 4.00, 
and this is in good agreement with what we find in actual runs (see Fig.|HJ. On a coarser lattice this 
factor would be somewhat larger (4.56 at (3 = 5.84) while on a finer lattice it tends to decrease (3.04 at 
(3 = 6.26), but it certainly remains substantial at all accessible couplings. 

To approximate the inverse square root or sign function over the relevant range, two main strategies 
are found in the literature. Polynomial 1201 QB] and rational pUl 1^ representations have been tried. 
We have concentrated on the Chebychev variant, since this one is efficient and easy to implement. It 
goes without saying that the lifting effect on the bulk of the I75-Dke1.nl eigenvalues translates into similar 
savings on CPU time in step 1 of the introduction, if another representation is used. For instance, in 
the rational approach it is the increase of the smallest zero of the denominator polynomial that lets one 
get away with fewer iterations in the inner multishift CO. 

For five dimensional variants of the overlap operator [221 12H1 IMl 1211 1201; in particular the domain 
wall formulation, the computational gain comes from the reduction of the extent of the fifth dimension 
needed to reach a given residual mass. What we would like to stress here is simply that our proposal 
to replace the "thin" links by "thick" links is generically useful for any kind of overlap variant. 



3 Locality 

It has been shown [27j that the overlap operator cannot be ultralocal, as opposed to the Wilson operator 
where Dy/{x,y) = for — > 1. To guarantee the universality of the underlying field theory and 
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Figure 9: Localization of the overlap at (3 = 6.0 without filtering and after 1 HYP step, for p= 1.0 and 
p = 1.4. A single HYP step proves more efficient than optimizing p. Filtering and p> 1 should not be 
combined; for the filtered operator the untuned choice p = l is reasonable (but still not optimal). 



hence to obtain the correct continuum limit it is sufficient to have an operator with 

Dov{x,y) Qc exp^—uWx — y\\) for — y||^l (8) 

where the localization u is of the order of the cut-off, i.e. z/ = 0(l) [in lattice units]. In practice, for a 
given lattice spacing the condition (jHl) gives an upper bound on any physical mass that one can extract, 
and it is therefore crucial to have an operator as local as possible, i.e. with a maximal u. In it has 
been demonstrated that the standard overlap operator indeed obeys (jH)). It is clear that their proof goes 
through for our filtered variant, but it is open in which way the localization u is infiuenced. Naively, one 
might think that the locality will deteriorate, since the original links entering the covariant derivative 
of the filtered kernel spread over a larger volume. As first observed by Kovacs [29^ , the filtered overlap 
turns out to be even more local than the standard one and this is achieved without tuning p. 

In Fig.iniwe plot the localization of Z^ov at (3 = 6.0 with two projection parameters (p = 1.0, 1.4) and 
two filtering options (none, 1 HYP). The ordinate is the maximum over the 2- norm of Dovi] at x with rj 
a normalized 5-peak source vector at the point y in the lattice, the abscissa is the "taxi driver" distance 
(ii = I |a;— ?/| |i to the location of the (5-peak, i.e. we plot the function 

f{di) = sup{\\{Doyri){x)\\2 \\x-y\\i = di} (9) 

versus di, as first studied in ^E\. Comparing the two unfiltered operators (black/dark diamonds and 
crosses) one finds their result reproduced that (at this P) adjusting p to a value around 1.4 lets f{di) 
fall off steeper than with the value 1.0 which is the canonical choice in view of the spectrum of the 
Wilson operator sufficiently close to the continuum (cf. Fig.^. The interesting observation is that 
a single HYP step together with p = 1.0 (red/light squares) results in an even steeper descent than 
the unfiltered version with p= 1.4 (which was chosen to nearly optimize the locality of the unfiltered 
operator). The last curve shown (red/light pluses) indicates that one should not attempt to combine 
the filtering with a p value that would be optimal for the unfiltered operator. 

An obvious question is whether filtering remains useful on fine lattices. Fig.^J shows the fall-off 
at four couplings with no smearing, 1 APE and 1 HYP step, with p = 1 fixed. On the coarsest lattice 
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Figure 10: Localization of the unfiltered overlap and of the versions with one APE or HYP step. The 
data are for the matched ensembles (8^ at /? = 5.66, 12^ at /? = 5.84, 16^ at /? = 6.00, 24^ at /3 = 6.26) 
and p=l throughout. On sufficiently fine lattices the choice of smearing proves irrelevant. 



p 


5.66 


5.84 


6.00 


6.00 (p = 1.4) 


6.26 


none 


0.330(18) 


0.236(17) 


0.308(09) 


0.571(10) 


0.370(07) 


lAPE 


0.344(30) 


0.447(29) 


0.577(13) 


0.543(06) 


0.586(18) 


3 APE 


0.429(49) 


0.634(30) 


0.682(11) 


0.485(04) 


0.549(09) 


IHYP 


0.469(48) 


0.642(32) 


0.695(10) 


0.480(05) 


0.554(05) 


3 HYP 


0.610(12) 


0.630(32) 


0.585(03) 


0.476(08) 


0.519(02) 



Table 4: Localization u of the overlap operator with an unfiltered Wilson kernel and after 1 or 3 steps 
of APE or HYP filtering. At /? = 6.0 we compare to p= 1.4 which is nearly optimal without filtering 
We use the definition (|^: the error is only statistical. 



smearing alters the locality just modestly, on the two intermediate ones (/3 = 5.84, 6.00) the locality gets 
substantially improved, with HYP doing a better job than APE. On the finest lattice, the improvement 
is still sizable, but there is almost no difference among the two filtering recipes. At this coupling further 
smearing steps would then diminish the locality. The localization measured with the definition ()29|) 
[which we use for technical reasons discussed below] is summarized in Tab.|l] 

There is a loose connection between the localization of Dov and the spectrum of H^^, for instance 

Q 

\\Doy{x,y)\\ < const x exp(--||x - y\\i) (10) 
is a bound found in j2Hj, where ||.|| is the matrix norm in Dirac and color space. The exponent ^/2 in 
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Figure 11: Localization u vs. p at (3 = 5.84 (left) and (3 = 6.00 (right) with no filtering, lAPE or IHYP 
step. This is the only case where we deviate from our convention to define z/ via ()29|1 and use di = ^L. 



firUjl is defined via the largest and smallest eigenvalue of _^Z}^ _^ through 



e 



2 



U/ZIX ' -max/ ' ■mm i - - i ^ /i i > 

cosh(6') = = (llj 



-^max/-^min 1 f ^ 

where we like to express the r.h.s. in terms of the inverse condition number e of |ifw|- Expanding either 
side to first order one obtains the simple relation (after getting rid of the unphysical 9<0 solution) 

l = e + 0{e'). (12) 

As already mentioned in |28j the exponent 6/2, defined via the spectral properties of the underlying 
li^wl, is a rather bad estimate for the actual localization u. The situation is not much better for the 
filtered variety, as a brief comparison of our Tabs.El and IH reveals^. Though quantitatively unsuccessful, 
this connection still gives a qualitative hint that the overlap operator with a filtered Wilson kernel 
might enjoy better localization properties due to the reduced condition number of H^. There are 
more detailed bounds in the literature jHOl EH E21 IHHl IM] , but it seems fair to say that a quantitative 
understanding of the localization of Dov in terms of the spectral properties of Hw is a challenge. 

The localization z/ as a function of the projection parameter p is presented in Fig. ^2 For (3 = 
5.84, 6.00 the optimum parameter for the unfiltered operator is around p= 1.6, 1.4, respectively. For the 
IHYP operator the localization at p = 1.0 does not fall short of the maximal one by a large amount; 
this is why we restrict much of our investigation with a filtered Dqv to the case p = 1.0. Still, the figure 
suggests that an optimal p for the 1 HYP filtered operator may be smaller than 1, and it decreases with 
increasing (3; at (3 = 5.8A we find Pqj^^^~1.0 and at (3 = 6.00 we find Poj^^^~0.8. 

After dealing with some technical issues to make sure that an 48^ lattice is large enough (see 
App.B), we have studied u defined via ()29|) as function of p in the free case; the result is shown in 
Fig-El The pattern observed in Fig. ^2 should thus not come as a surprise, filtering simply drives the 



■^Tab.|21contains the condition number on the subspace orthogonal to the 14 lowest modes, while e in H1UII12|) refers to 
the full operator. For two reasons we propose to re- interpret (|12|) as a prediction for the locality of Dov with e the ratio of 
the lower to the upper end of the bulk of eigenvalues of |i?w|- A practical hint is that the unprojected condition number 
fluctuates wildly (see Fig.El), whereas the localization is rather stable for all configurations in an ensemble. Furthermore, 
in 1^ it is shown that an isolated near-zero mode of |i?w| does normally not affect the locality of Dov Of course, one 
cannot repeat that argument indefinitely, but still a test whether a modified e helps is interesting. 
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Figure 12: Localization (j29|l in the free field case with an extremum at and a steep descent 

to the left. Note that in the p range shown the inverse condition number e = p/(8 — p) is monotonic. 



locality properties of the overlap operator towards the free field case. In fact, Fig. offers a simple 
explanation why it is so difficult to predict the localization u from spectral properties of the underlying 
|ifw| operator - in the free case the inverse condition number (jH]) in the range 0<p< 1 is monotonic, 
while z/ has a non-trivial extremum at p^p^ ~0.54. 



4 Kernel non- normality 

An operator A is called normal, if it commutes with its adjoint 

[A,A^=0 (13) 

which implies that its left and right eigenbasis coincide. Normality has special implications for lattice 
Dirac operators. For a normal Dirac operator D = J2k '^fc|^)(^| which, in addition, is 75-hermitean 

75D75 = (14) 

we immediately obtain 

D^ = Y.Xl\k){k\ = Y.X,^,\k){k\^, (15) 

k k 

and this implies that eigenmodes with real are chiral (or may be linearly combined to chiral modes 
in case of degeneracies). Furthermore, for such a D the eigenvectors of the hermitean Dirac operator 

H = ^,D = J2Xkj5\k){k\ (16) 



are given by \lx*k\k) ± \l\pi^\k) with the corresponding eigenvalues ±|A/; 

The continuum Dirac operator is normal, and so are the naive and staggered discretizations (but 
the latter two yield more than one flavor in the continuum limit). The GW relation together with 
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Figure 13: Non-normality (in lattice units) of the Wilson kernel, as defined in (fTTj) . as a function of 
smearing. The improvement of the kernel normality does not seem to degrade towards the continuum. 

75-hermiticity ()14j) also implies normality of the operator, hence D^v is normal. In fact, the overlap 
construction can be described as extracting the unique unitary part of -Dkem/p ES], and for a normal 
kernel it reduces to a simple radial projection of the Dkem/p eigenvalues onto the unit circle. 

The shifted Wilson operator, which we use as a kernel, is not normal. Some consequences of this 
non- normality have been explored in other contexts j2Bj- Here, it suffices to point out that the relations 
between the eigenmodes of the overlap operator, its kernel and the hermitean Dirac operator are not 
as simple as above for the case of a normal operator. Typically, an eigenvector of the (hermitean) 
kernel will mix into every mode of the overlap operator, which we expect to have a detrimental effect 
on the efficiency of overlap construction algorithms. Thus, a practically relevant question is whether 
UV filtering can reduce the amount of non-normality of the overlap kernel. 

To quantify the non-normality of we measure the 2-norm of the commutator; technically 

||[/^w,-i,^Vi]l^)ll (17) 

is averaged over a number of normalized random vectors \r]). In Fig.^Jthe commutator (fT7|) is shown for 
all f5 and smearing levels (since this is not a physical observable, we use lattice units). Evidently, any kind 
of filtering reduces it - the filtered kernel is thus closer to normality and has left- and right-eigenvectors 
that are better aligned than for the unfiltered version. Whether "smart" overlap construction algorithms 
can be written which exploit this property is an open question. 

5 Physics perspectives 

To explore the physics potential of filtered overlap quarks a quenched spectroscopy study would be 
highly desirable. Physical results should reproduce - after a continuum extrapolation - results in the 
traditional "thin link" formulation. It would be interesting to see whether the speedup in point 1 of 
the introduction gets enhanced in points 2-4; in particular if scaling and/or asymptotic scaling set in 
earlier, this would make a real difference. Unfortunately, a detailed scaling study requires substantial 
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computational resources, but as a first step in this direction we want to investigate the renormahzation 
of the axial-vector current with filtered overlap quarks. 

We follow the method of [SZl EHl (HH] , where one starts from the usual (chirally rotated) densities 

P{x) = 7/;i(x)75[(l-^/^ov)V'2](x) (18) 

A^{x) = V'i(a:)7/.75[(l-^^ov)^2](x) (19) 

with ipiy^ip2 (flavor non-singlet) and defines the correlators [x = (x, t)] 

Gppit) = ^P(x,t)P^(O,0) (20) 

X 

GvAp{t) = EV4A4(x,t)P^(0,0) (21) 

X 

where V4 is the symmetric derivative in the time direction and P'^ is the conjugate of (fTHj) . i.e. with the 
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Figure 14: mf^^^+m^^^ vs. m\^^^+m2^'^'^ [slope = Z^^] at /9 = 5.66 . Without filtering Za<0, while any 
filtering prescription gives > 1 , with higher filtering levels resulting in a value closer to 1 . 
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Table 5: Za determined via with various filtering prescriptions. On the last line the 1-loop result 
mj for the thin-link overlap Zv,a = 1 + Cf 0.198206^^ + 0{g^) = 1 + 1. 585648//? + 0(1/ p^) at p = 1.0 
and Zv,A = 1 + Cf O.OQOSOlc/^ + 0{g^) = 1 + 0.722408//3 + 0(1/ P^) at p= 1.4 is added for comparison. 



flavor indices 1^2 interchanged. With these correlators at hand one forms the ratio 

p(t,mi,m2) = ^ (22) 

where the second and third argument indicate that the spinors ipi and tp2 in the densities fll8| IT!I|l 
are solutions to the massive operators -Dov.mi and Dov,m2! respectively. On account of the axial Ward 
identity (AWI) the ratio p should be constant in time, and for light enough quarks ()22|1 tends indeed to 
plateau rather nicely (see e.g. Fig. 1 in [Hll)- In a slightly sloppy but transparent notation the plateau 
value is p{mi,m2). This quantity will - to the extent to which the AWI is respected at finite lattice 
spacing - only depend on the sum^ of the quark masses, and thus defines the quark masses 

p(mi,m2) = p(mi+m2) + ©(a^) = m^^i ^ ^Awi ^ q^^2^ _ (23) 



The actual data for our Z^ determination for quenched filtered and unfiltered overlap quarks are 
generated with couplings and geometries as given in the last line of Tab.^ We restrict ourselves to 
the canonical choice p = 1. We plot p(mi,m2) versus mi + m2 for various quark mass combinations 
and filtering levels in Fig.fT^ for /3 = 5.66 and in Fig.lTHl for /3 = 5.84, 6.00, respectively. They form one 
universal band, i.e. different mi and m2 combinations with a fixed sum mi + m2 always give the same 
p(mi+m2) [within errors]. Furthermore, the relationship is in good approximation linear, but there is 
an anomaly without filtering at our strongest coupling fFig.lT^. Here, the slope is negative, and this 
supports the view established in App. A that with /5 = 5.66 and p = 1.0 the projection point is "in" or 
"to the left" of the physical branch of the underlying Wilson operator, and we effectively operate in the 
"zero fermion" sector. Also at /3 = 5.84 the unfiltered plateau was not very pronounced either, resulting 
in a large systematic uncertainty beyond the statistical error quoted below. We use the ansatz 

p(mi + m2) = const + — (mi+m2) + const (mi+m2)^ (24) 

Za 

and see whether we obtain acceptable fits and whether the first constant is consistent with zero. It 
turns out that this is the case, and the associate Za values are summarized in Tab. El 

It is interesting to discuss both the general pattern of these Za values and the relation to 1-loop 
perturbation theory. Evidently, at fixed (3 and p the filtered Za is much closer to the tree- level value 1. 
We recover the relative strength ordering of Sect. 2, i.e. one APE step is less efficient than 3 APE or a 
single HYP step, but the latter is topped by 3 HYP steps. At /3 = 6.0 we compare to p = 1.4 which is the 

^In principle, we might use the covariant conserved current for overlap quarks (see |40| and the 2nd work in |44j ) with 
the "thin" links replaced by "thick" links. Then the last term on the r.h.s. of (|23l I24|l would be absent, and the AWI 
would be an exact identity. However, there is a practical problem with APE or HYP filtering, due to the SU (3) projection 
involved. The solution via stout/EXP links is in exact analogy to the dynamical case discussed in the last section. 
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the standard choice for the "thin hnk" overlap. Without filtering, Z^°°'^(p = 1.4) ~ 1.554 is about half 
of Z^°'^'^(p = 1.0) ~ 3.145, and this means that the choice p=1.4 is not just near-optimal w.r.t. locality, 
but also beneficial to tame (one particular) renormalization. Once the filtering recipe is specified, Za 
seems to be monotonic in Q/(3 = g^, as expected from perturbation theory. In the unfiltered case the 
1-loop value is included in the last line of Tab. El for comparison. Assuming that in perturbation theory 
1 < Z^^^ < Z^""*^ holds for p = 1, one may compare the deviation of the unfiltered /? = 6.0 operator 
3.145 — 1.264 = 1.881 to 1.153 — 1 = 0.153 which then amounts to an upper bound in the 1 HYP case. 
Evidently, the discrepancy is dramatically reduced, which in view of the perturbative results in |121I1S1, 
should not come as a surprise. To get a slightly more quantitative view, we consider it useful to fit our 
data without filtering at p=l to a Pade-type ansatz of the form 

^none _ 1 + CjX + C2X^ 

^ 1 + (ci - 1.585648)x ^ ' 

with x=l/f3, where the perturbative knowledge [H] (cf. caption of Tab. El) is built-in as a constraint. 
In the same spirit a Pade ansatz for any of the filtered operators reads 

^1HYP/3HYP 1 + CiX + C2X^ 

Z. = ; (26) 

with - as of now - no constraint on Ci— C3 yet. There is a problem with the functional forms ()25| 
since our data sets contain 2 and 3 entries, respectively, and there is zero degree of freedom. Still, for an 
illustration such a "fit" might be worth while, and the result is shown in Fig.^J With sufficient data 
the curves would contain two pieces of information. The asymptotic slope for would predict the 

perturbative 1-loop coefficients for Z^^^ , Z^^^^. And the pole in (jHSEEl), i-e. the values ci — 1.585648 
or C3, respectively, would predict the coupling where the perturbative description breaks down. Hence, 
if the curves in Fig.^J are indicative at all, it seems that filtering renders the perturbative 1-loop 
coefficient of Z^ much smaller, but the perturbative range gets barely enhanced. 
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Figure 16: Pade functional forms through our p = 1 data in standard (left) or double logarithmic form 
(right). In the unfiltered case the constraint to reproduce the known 1-loop behavior is built in and the 
latter is indicated with a dotted line. It seems the 1-loop coefficients of the filtered Za are dramatically 
reduced, while the perturbative range (blow-up point of the Pade ansatz) gets barely enlarged. 
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6 Discussion 



In this paper we have studied the massless overlap operator constructed from a filtered Wilson kernel 
where the original "thin" links were replaced by "thick" links which behave in the same manner under 
local gauge transformations. This is a legal change of the fermion discretization as long as one particular 
filtering recipe [e.g. 1 HYP step with ohyp = (0.75, 0.6, 0.3)] is maintained at all couplings. It amounts 
to an O(a^) re-definition of i^ov at fixed p, as does a change of p at fixed filtering level. 

Our key observations are the following. First, the onset of the "bulk" part of the spectrum of the 
underlying shifted hermitean Wilson operator if-yv = 75(-Dw~p) gets lifted. This leads to an increased 
inverse condition number e (after projection typically by a factor 2-4 through a single HYP step) and the 
latter reflects itself in a reduction (by the same factor) of the polynomial degree (and thus the number of 
forward applications of H"^) needed to construct the inverse square root over the relevant range. What 
is the precise impact on CPU requirements to invert the massive operator is a topic for future research. 
Second, at standard couplings the filtered massless overlap is - even with the untuned canonical choice 
p = 1 - better localized than the unfiltered version with an optimally tuned p could ever be. Our finding 
is backed by the observation that in the free case the optimum p (w.r.t. locality) is around 0.54 and 
thus substantially smaller than the typical p~ 1.4 used in the past. Our third observation is that the 
filtered kernel is much closer to being a normal operator. In other words the left- and right-eigenvectors 
of are better aligned with higher filtering level, and in this respect the effect of the "thick" links 
is the same as a shift much closer towards the continuum under which the overlap construction (jH)) 
tends to be a simple radial projection of the eigenvalues. Finally, our fourth observation is that 
the renormalization constant of the axial- vector current is much closer to 1 with filtering than without. 
We rate this as a sign that lattice perturbation theory for the filtered overlap might work much better 
than for the unfiltered variety. If this is indeed so, and if it goes through for 4-fermion operators, it is 
likely to be the most important consequence of our work, since it offers the perspective of considerably 
reduced theoretical uncertainties in electroweak precision studies. 

Let us finally discuss a variety of proposals in the literature that are similar in spirit to the one put 
forth in the present paper. 

There is a top-level version deriving from "parametrized fixed-point fermions" . The idea behind this 
approach pursued by Hasenfratz and Niedermayer is that true fixed-point fermions would satisfy the 
GW relation exactly 6J, but a practical implementation is always ultralocal. Hence, sticking such an 
ansatz into the overlap formula (jH)) yields fermions with exact chiral symmetry and otherwise properties 
that are at least as good (but typically better) than the version with a plain Wilson kernel [H]. 

Bietenholz has considered a variety of actions, originally based on RG concepts [IH]. The idea 
was that an action with a spectrum close to the GW circle could be iteratively improved in its chiral 
properties. Over time the focus has shifted towards using the overlap formula Q to have exact chiral 
symmetry, but it is clear that the kernel of his "hypercubic overlap" benefits from a larger inverse 
condition number e of |75-Dkcrn| just as we do. 

Gattringer and collaborators construct a "chirally improved" Dirac operator that involves the full 
Dirac Clifford algebra with links restricted to the hypercube. The coefficients are adjusted such that 
(for a given coupling) the violation of the GW relation is minimized jlHI. The problem is the same as 
in the Bietenholz approach: a single forward application with such a kernel is so expensive that the 
improvement, if it is not "perfect", does not really pay off. 

DeGrand has considered - both perturbatively and non-perturbatively - Wilson and clover action 
varieties that involve smeared gauge links |17l |32] • Based on this experience he went on to construct a 
"variant overlap" which starts from a kernel with only scalar/vector terms and smoothed links, and is 
thus sufficiently cheap as to allow for sticking it into the overlap formula I43j . 

The closest to what we do is found in the work of Kovacs 29^ . He uses a "fat-link clover" overlap 
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in which all links are smeared, together with the tree-level value csw = 1- As far as we know, he was 
the first author to notice that such a filtered kernel allows for the untuned choice p= 1, and still the 
resulting overlap shows good localization properties. 

A related approach has been pursued by the Adelaide group [IHl- Their "fat link irrelevant clover" 
overlap quarks are built from a clover action in which only the irrelevant pieces (i.e. the Wilson and the 
Sheikoleslami-Wohlert terms) use smeared links, but not the covariant derivative. They found a similar 
speedup factor in the construction of the overlap operator (cf. "step 1" in the introduction) and tied it 
to the reduced spectral density of |75-Dkern| near the origin. 

Finally, "overlap" quarks with smeared gauge links have been used by several lattice collaborations. 
RBC has found that the residual mass of domain- wall fermions at fixed gets reduced [SO], though 
they miss out an important ingredient, the projection to SU{3). UKQCD has used overlap valence 
quarks with 3-fold HYP smeared links on staggered sea as supplied by the MILC collaboration, finding 
a surprisingly good signal on as few as 10 configurations [ST]. Similarly, LHP and NPLQCD have used 
filtered domain- wall valence quarks on staggered sea to compute the pion form factor [52j and the 1 = 2 
TTTT-scattering length jSH], respectively. 

There is another idea that should not be confused with filtering. Using an improved gauge action 
has been found to reduce p\Hw\i^) ^y up to an order of magnitude [^31133 EE]- There is, however, an 
important practical difference to the filtering concept, which is a modification of the fermion action. 
As already discussed in [Tl], a better choice of the gauge action improves, in the first place, the very 
low end of the |-ffw| eigenvalue distribution. After projecting out the lowest 0(15) eigenvectors (which 
nowadays is a standard thing to do [16j) much of the advantage is lost (in Fig. 3 of [56j the lifting factor 
diminishes to the right). By contrast, filtering lifts the complete low-energy end of the |-ffw| eigenvalues 
(in our Fig. El one finds an almost-universal lifting factor) and the usefulness of filtering is not vitiated 
by the projection. Still, it might be interesting to see whether the two ideas can be fruitfully combined. 

An extension of the filtering concept to full QCD is straightforward, albeit hampered by a technical 
problem. These days, most dynamical fermion simulations are set up with a HMC algorithm, and the 
latter requires the fermion action to be differentiable w.r.t. the gauge links. The kernel of our filtered 
overlap quarks is differentiable w.r.t. the "thick" links, but not w.r.t the elements of the original set, 
due to the projection involved in the APE |8j or HYP [9^ procedure. A convenient way out is offered 
by the stout/EXP links introduced in [2Z], involving a differentiable mapping between the "thick" 
and "thin" links. In pure gauge observables the usefulness of this smearing recipe was found to be 
restricted to small parameter values jSZl^j, and one may fear that this feature persists in stout/EXP 
overlap quarks, since in perturbation theory they are equivalent to APE filtered overlap fermions with 
ttAPE = 1/(1 — Soexp) jSH]- Thus, due to the pole at oexp = 1/6 we expect them to have a "narrow 
therapeutic range" in parameter space, but it is clear that there is no conceptual issue in simulating 
full QCD with filtered overlap quarks beyond the difficulties met in the unfiltered case 

To summarize, our suggestion is to use the overlap recipe (jH)) with an unimproved (csw = 0) Wilson 
kernel in which all links are replaced by some smeared descendents of the actual gauge background. 
We recommend to stay with a moderate amount of link "fattening" , e.g. with a single step of standard 
HYP smearing 0. The projection parameter p may be fixed at its canonical value 1, and in this sense 
the filtered overlap involves less tuning than the unfiltered version^. An important restriction is that 
the choice of iteration level and smearing parameter must be the same for all couplings considered in 
a scaling study. This is one point on which our proposal differs from some of the attempts reviewed 
above which involve coefficients (e.g. in the extended 7-algebra) that are adjusted "by hand" to yield a 
GW-type spectrum at one standard value of the gauge coupling. The other difference is that our kernel 
remains cheap and still requires fewer dI^^^D^^^^.^ forward applications. From a practical viewpoint, 

^Of course, there are parameters in the filtering recipe, but our results show that they hardly matter. Thus filtering 
allows one to trade a parameter that needs to be tuned for parameters on which the lattice data show very little sensitivity. 
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a clear advantage is the ease of implementation of the "filtered overlap" — everyone with a running 
overlap code has it (in disguise). 
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App. A: Cumulative eigenvalue distributions in 4D and 2D 

In this appendix we discuss what can be learned from the cumulative eigenvalue distribution (CED). We 
consider both the eigenvalues in 4D generated for the main part of this paper and data from dedicated 
runs in the quenched Schwinger model (QED with massless fermions in 2D) to elucidate the effects that 
filtering and changing f3 have on the spectral density of the hermitean Wilson operator ifw = 75-Dw,-i- 

Fig-El presents the cumulative eigenvalue distribution (CED) of the 15 smallest eigenvalues of 
ji^wl on the ensembles discussed before. We show it both in standard form and in double logarithmic 
form, and the scale on the ordinate follows from the requirement that it would extend up to 1, if all 
eigenvalues were calculated (cf. Fig.UHl below). For the two intermediate couplings (/3 = 5.84, 6.00) we 
see the expected linear rise of the CED near the origin, which soon gets complemented by a higher order 
piece. The coefficient of the linear part is a measure for the spectral density of the hermitean Wilson 
operator at the origin, p^j^^^{0). That density being non-zero means that there is a finite probability to 
encounter arbitrarily small eigenvalues. The main effect of smearing is to reduce this spectral density, 
as is evident from the double logarithmic plots - here the initial slope 1 piece gets shifted downwards, 
and this corresponds to a smaller coefficient in front of the linear piece in the standard representation. 
Our data at /? = 6.26 are of lesser quality - here we definitely cannot identify a linearly dominated 
regime. The situation is far more favorable in Fig.lTHlwhere quenched Schwinger model data are shown. 
Apart from the statistics, the main difference is that all eigenvalues (extending up to ~ 3 in 2D) are 
included. The higher the filtering level or j3, the more pronounced is the "jump" in the CED at A~l. 
Note that with a chiral kernel all eigenvalues of |75-Dkcrn| would be there, i.e. the CED would be a step 
function at A = 1. Finally, to come back to Fig.[T71 the situation at the strongest coupling (/5 = 5.66) 
is different, since here the linear piece in the unfiltered CED is not larger than in the filtered versions. 
This is, because our choice p=l lets us "loose" the fermion - at this coupling our projection point is 
somewhere "in" the physical branch or "to the left" of it, while for the filtered version p = 1 is still 
appropriate. One might avoid such a situation by choosing a larger p with the unfiltered kernel, but an 
even safer option might be to refrain from simulating unfiltered overlap quarks on such coarse lattices. 
It looks like this is a situation where the filtered overlap may help a lot, since it allows simulations on 
coarser lattices than the unfiltered operator, but in order to really be useful such simulations should be 
in the scaling regime (and not just in the right universality class), and this is, of course, not yet clear. 

The spectral properties of i^w play a role in the context of the physical interpretation of the Aoki 
phase The latter is a conjectured phase, originally specific to Nf = 2 active Wilson fermions at 

negative mass, in which after switching off an external trigger term 

S'sourcc = ihip'j^asip (27) 

parity and flavor break spontaneously and a condensate (const 7^0) 

lim {'ip'y^o'sil') = iconst (28) 
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Figure 17: Cumulative eigenvalue distribution (CED) of |-ffw| in standard (left) or double logarithmic 
(right) form for /? = 5.66, 5.84, 6.00 (from top to bottom) with 0,1,3 steps of APE or HYP filtering. At 
each (5 the upper cut in the vertical direction is the same on the left and on the right and the upper end 
in the horizontal direction is 0.2 throughout. Note the change in the ordinate scale between different 
couplings. At /5 = 6.26 we definitely lack the statistics needed to see a linearly dominated regime. 
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Figure 18: Log-log plot of the cumulative eigenvalue distribution (CED) of |-ffw| in the Schwinger model 
{Nf = 0, 16^ geometry, (3 = 3.2, p=l) without filtering (top curve) and after 1 or 3 filtering steps. In all 
cases the CED starts out linearly, raises sharply somewhere near A~l and reaches 1 at A ~3 (in 2D). 
This figure includes all eigenvalues of all three operators on 22,500 decorrelated configurations. 

forms. Good numerical evidence for a non-zero condensate (j^Hj) in the (dynamical) 2-fiavor case for an 
appropriate choice of the negative mass — p(/3) is found in jHI]. Ref. argues that in the massless limit 
of the continuum theory a condensate of the form (j28|l is simply an axial rotation of the usual (flavor 
diagonal) condensate and thus breaks neither parity nor flavor. They relate the spectral density of 
to that of i^w and argue that the absence of a gap (around the origin) of the latter is indicative of chiral 
symmetry breaking and that p\Hw\i^) > if and only if is non-zero. This was later elucidated to 
be a continuum argument which - in view of our Sec. 4 - might be an important point. 

The next issue is whether there is an Aoki phase in the quenched theory with 2 valence (but sea) 
flavors [nH ES] ■ The simplest expectation is that qualitatively the picture with the 5 Aoki "fingers" 
goes through, though the phase boundary is somewhat shifted w.r.t. the Nf = 2 case. 

Our 4D data in Fig. [T3 clearly show the suppression of p\Hw\i^) approaches the continuum, 

but we cannot see any sign that this distribution would vanish at some "critical" coupling. Given the 
uniform pattern in the figures (apart from the scale on the ?/-axis they seem qualitatively similar), it 
seems more likely to us that p\Hyf\i^) ^i^^ ^^^J non-zero for arbitrary couplings. 

To test this view, we analyze the quenched Schwinger model where high statistics can be reached. 
The couplings and geometries are chosen such as to have a fixed physical volume, with a box size about 
5 times larger than the Compton wavelength of the lightest degree of freedom in the chiral limit of the 
Nf = l theory. A survey of the parameters is given in Tab.lHland for technical details we refer to [T^. 

Fig. El provides an overview over the complete |-ffw| eigenvalue distribution; one sees a "peak" 
at A = 1 forming that gets more pronounced with higher (3 and higher filtering level. This "peak" 
corresponds to the "jump" at A = l in the CED of |-ffw| in Fig.lTHl 

Fig.OUlpresents the distribution of the lowest eigenvalue of |ifw|- At low (3 this distribution accumu- 
lates at zero, at intermediate values of the coupling there is a horizontal band of eigenvalues connecting 
down to zero, and at the largest P there are just scattered eigenvalues. Evidently, one cannot draw a 
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8x8, p=0.80, all eigenvalues, nsmear=0 8x8, p=0.80, all eigenvalues, nsmear=1 8x8, p=0.80, all eigenvalues, nsmear=3 




0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 

12x12, p=1.80, all eigenvalues, nsmear=0 12x12, p=1.80, all eigenvalues, nsmear=1 12x12, p=1.80, all eigenvalues, nsmear=3 




0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 

16x16, p=3.20, all eigenvalues, nsmear=0 16x16, p=3.20, all eigenvalues, nsmear=1 16x16, p=3.20, all eigenvalues, nsmear=3 




0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 

20x20, p=5.00, all eigenvalues, nsmear=0 20x20, p=5.00, all eigenvalues, nsmear=1 20x20, p=5.00, all eigenvalues, nsmear=3 




0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 

24x24, p=7.20, all eigenvalues, nsmear=0 24x24, p=7.20, all eigenvalues, nsmear=1 24x24, p=7.20, all eigenvalues, nsmear=3 




Figure 19: Distribution of all eigenvalues of |i/w| = |75-Dw,-i| at five (3 and 0,1,3 smearings. 
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8x8, p=0.80, eigenvalue 01 , nsmear=0 




0.2 0.4 0.6 0.8 1 

1 2x1 2, p=1 .80, eigenvalue 01 , nsmear=0 




0.2 0.4 0.6 0.8 1 

16x16, p=3.20, eigenvalue 01, nsmear=0 




0.2 0.4 0.6 0.8 1 

20x20, p=5.00, eigenvalue 01, nsmear=0 




0.2 0.4 0.6 0.8 1 

24x24, p=7.20, eigenvalue 01, nsmear=0 




0.2 0.4 0.6 0.8 1 



Figure 20: Distribution of the 



8x8, p=0.80, eigenvalue 01, nsmear=1 




0.2 0.4 0.6 0.8 1 

12x12, p=1.80, eigenvalue 01, nsmear=1 




0.2 0.4 0.6 0.8 1 

16x16, p=3.20, eigenvalue 01, nsmear=1 




0.2 0.4 0.6 0.8 1 

20x20, p=5.00, eigenvalue 01, nsmear=1 




0.2 0.4 0.6 0.8 1 

24x24, p=7.20, eigenvalue 01, nsmear=1 




0.2 0.4 0.6 0.8 1 



1st eigenvalue of |i/w| = |75-Dw,-i 



8x8, p=0.80, eigenvalue 01, nsmear=3 




0.2 0.4 0.6 0.8 1 

12x12, p=1.80, eigenvalue 01, nsmear=3 




0.2 0.4 0.6 0.8 1 

16x16, p=3.20, eigenvalue 01, nsmear=3 




0.2 0.4 0.6 0.8 1 

20x20, p=5.00, eigenvalue 01, nsmear=3 




0.2 0.4 0.6 0.8 1 

24x24, p=7.20, eigenvalue 01, nsmear=3 




0.2 0.4 0.6 0.8 1 



at five /5, with 0,1,3 smearings. 
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8x8, |3=0.80, cum. eigenval. distr. 



0.02 



0.015 



0.01 



0.005 




0.2 0.4 

-3 12x12, (3=1.80, cum. eigenval. distr. 




X 10 



0.2 0.4 

-5 20x20, |3=5.00, CED from 10 low. eig. 




0.2 0.4 

Figure 21: Cumulative eigenvalue distribution of |/fw| = |75-Dw,-i| at five /?, with 0,1,3 smearings. 
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Figure 22: P|^^|(0) versus P in the quenched Schwinger model. Filtering seems to reduce, for all /?, the 
eigenvalue density by a near-universal factor. The data suggest an exponential fall-off at large f3. 



final conclusion whether these scattered eigenvalues really make up for a non-zero p(0), but it seems 
worth while to study this band in the region of f3 values where it is clearly visible and see whether 
changing (3 implies some structure, or whether it just stays flat, regardless of /?. 

Fig-E] presents the CED in the area of interest, the very-low A region. At (3 = 5.0, 7.2 we show 
the data from the high-statistics run with 10 eigenvalues per configuration (bottom line of Tab.lH}, 
but we checked that the results are consistent with what we get from the runs where all eigenvalues 
were determined. At a given coupling filtering clearly reduces p\Hw\i^)- The overall impression is that 
changing f3 merely rescales the y-axis, in striking analogy with what we have seen in 4D (Fig. fTTjl . If 
this is indeed true, the natural conclusion is that p\Hw\i^) >0 at any finite P in the quenched theory. 

Fig. [^contains a summary of our determinations of the spectral densities p\Hw\i^) various (3 and 
filtering levels, extracted from the initial slopes in the CED shown in Fig.|^ It looks like eventually 
the density decreases exponentially in (3 and changing the filtering level amounts to an overall rescaling 
factor which is, in good approximation, independent of (3. Obviously, this is just numerical evidence, 
but the message seems to be as clear as one can possibly hope for from a numerical experiment. Note 
that for practical reasons we cannot take the infinite volume limit, but given our physical box size we 
expect finite volume effects to be exponentially small. 

We remind the reader that (both in 2D and in 4D) we were working at fixed negative mass mo = — 1 
and pushed towards the continuum line. In other words, we were trying to stay as far outside the Aoki 
phase as one can, if one wants to be in the supercritical region with one overlap fermion. Of course, our 
data do not exclude the existence of a critical (3, but they favor the view that there is no /?crit that makes 
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Table 6: Matched quenched 2D lattices; reweighting to Nf = 1 the lightest particle in the chiral limit 
would fit 5 times into the box. In the first set all |-ffw| eigenvalues are determined with statistics such as 
to have an equal number of eigenvalues. In the second set only the lowest in eigenvalues are computed. 

the |ifw,-i| spectral density strictly zero and therefore we conjecture that P|_h'w|(0)>0 throughout the 
supercritical region. Still, we do not see why this would create a problem for the localization of the 
overlap operator, since the two seem not one-to-one inversely connected. 

Finally, there is a simple argument that P|_f/w|(0) >0 holds in all quenched or unquenched theories 
with a massive overlap determinant at all couplings. Acquire infinite statistics at /3 = n, A^ = n. When 
integrating over the full configuration space the spectral density is certainly non-zero. Results for the 
case of interest at finite (3 and maybe finite Nf can be obtained through reweighting. As long as one 
can guarantee that there is no configuration where the reweighting factor vanishes, the spectral density 
will be modified, but it cannot be made strictly zero. This holds true in the quenched case and in 
the dynamical theory with an overlap determinant (mT^n, — 2p), but it would not be true with Wilson 
fermions at a negative mass. 

App. B: Overlap operator locality in the free case 

In this appendix we collect some technical points to make sure that a numerical investigation of the 
localization v versus p as shown in Fig.Elfor a 48^ lattice is not overwhelmed with finite size effects. 

Considering f{di) as given in ^3 ^ finite lattice one encounters a technical problem that 
is evident in Fig.|2ni The free field case is far from showing rotational symmetry and the supremum 
function © has a couple of initial bumps (in particular at (ii = 4, 8, 12), and this means that one needs 
to go to sufficiently large distances to measure the slope in a logarithmic representation. On the other 
hand, the choice to measure the distance in the 1-norm leads to rather large finite size effects for di > L, 
in particular the region near the maximal distance di = 2L is heavily contaminated. Therefore, we tried 

^=llog(/(L-l)//(L + l)) (29) 

as a technical definition of the localization z/ in (jH)). The comparison between the 32^ and 48^ geometries 
shows that our choice to evaluate the logarithmic derivative a.t di = L produces rather consistent ly values, 
and we take this as a sign that they cannot be far from the asymptotic exponent. For the (two-digit- 
precision) projection parameter that we find to be optimal w.r.t. locality in the free case, p^p^ — n.54, 
the correlator is explicitly shown to be steeper than in the p = l case. 
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Figure 23: Free overlap couplings in four directions and the function together with their "effective 
masses" for (32^, p = 1) and (48^, p = 1). Some effective masses are missing, since correlator values 
below 10^^^ have been cut off (double precision limit). The good agreement of the 32^* and 48^ data 
with evaluation point di = L (dotted vertical lines) suggests that these data are much less affected by 
finite size effects than those near the maximal di = 2L. The correlator with p = 0.54 is visibly steeper. 
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